JOURNAL OF APPLIED BEHAVIOR ANALYSIS 


2004, 37, 457M68 


NUMBER 4 (winter 2004) 


THE EFFECTS OF CONDUCTING BEHAVIORAL 
OBSERVATIONS ON THE BEHAVIOR OF 
THE OBSERVER 

Alicia M. Alvero 

QUEENS COLLEGE, CITY UNIVERSITY OF NEW YORK 


AND 

John Austin 

WESTERN MICHIGAN UNIVERSITY 


Behavior-analytic approaches to occupational safety are often effective for improving safe- 
ty in organizations, and have been successful in a wide variety of settings. The effects of 
these safety processes are thought to arise primarily from the behavioral observation 
process and the delivery of feedback. Typically, supervisors or employee observers involved 
in behavioral safety implementations conduct observations. The present study was an 
attempt to assess the effects of conducting observations on an observer’s safety perfor- 
mance. An ABC multiple baseline counterbalanced across two sets of behaviors was 
conducted in a simulated office. The target behaviors involved knee and back positions 
during lifts; back, shoulder, and feet positions while sitting; neck and wrist positions 
while typing; and neck position during phone use. Substantial improvements in safety 
performance occurred after participants conducted observations on a videotape of a con- 
federate’s performance. The possible behavioral functions responsible for this change, and 
the implications of these findings for applied settings, are discussed. 
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In 2000 alone, a total of 5.3 million non- 
fatal injuries were reported in private indus- 
try workplaces; these injuries cost the coun- 
try nearly $21 billion per year (U.S. Bureau 
of Labor Statistics, 2001). Occupational ac- 
cidents have contributed to growing work- 
ers’ compensation premium payments, 
which totaled $57.1 billion in 1995 (U.S. 
Bureau of Labor Statistics, 1999). 

The behavior-analytic approach to occu- 
pational safety has proven to be effective in 
changing behavior and reducing injuries and 
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related costs (e.g., Fox, Hopkins, & Anger, 
1987; Sulzer-Azaroff, Loafman, Merante, & 
Hlavacek, 1990) and has demonstrated suc- 
cess across various settings and populations 
(e.g., increasing safety of roofing crews, Aus- 
tin, Kessler, Riccobono, & Bailey, 1996; in- 
creasing safety of industrial workers, Sulzer- 
Azaroff & de Santamaria, 1980; increasing 
safety of office personnel, McCann & Sulz- 
er-Azaroff, 1996). Prior research has identi- 
fied the principal components of an effective 
behavior-based safety process, which usually 
include assessment and identification of per- 
formance targets, development and imple- 
mentation of a behavioral observation pro- 
cess, review of observation data, and imple- 
mentation of behavioral feedback. The as- 
sessment involves a detailed review of an 
organization’s injury records, first-aid logs, 
interviews with employees, and a review of 
any other records that may provide useful 
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data concerning previous safety efforts (Sulz- 
er-Azaroff & Fellner, 1984). From this in- 
formation, performance targets (or at-risk 
behaviors) are identified and compiled into 
an observation checklist. 

The next step involves developing an ob- 
servation process and training employees to 
conduct safety observations 1 using the 
checklist. When conducting observations, 
observers approach other employees and ob- 
serve the behaviors listed on the checklist. 
Usually, the observed employee’s perfor- 
mance is scored on the checklist, and the 
observer delivers immediate verbal feedback. 
The data from the checklists are then con- 
verted into safety percentages that are re- 
viewed by either supervisors or employees or 
are graphed and posted in a prominent place 
for all employees to see. Frequent observa- 
tion and reviews of the data help the safety 
professionals to adapt the process as needed. 

The majority of previous research in this 
area has focused on the effects of feedback 
on organizational performance (see Alvero, 
Bucklin, & Austin, 2001, and Balcazar, 
Hopkins, & Suarez, 1985, for reviews of the 
feedback literature), but there are other in- 
dependent variables in the safety process that 
have been less adequately studied. One of 
these is the observation process itself. In the 
behavior-analytic approach to occupational 
safety research literature, researchers, and oc- 
casionally supervisors, conduct observations. 
However, in employee-driven programs (i.e., 
in real-world application), most or all em- 
ployees are trained to conduct behavioral ob- 
servations. Interestingly, the employees who 
conduct the observations are the same em- 
ployees who should engage in the safety tar- 


1 When a person conducts a safety observation as 
part of a behavioral safety process, he or she must 
observe and evaluate another person’s behavior and de- 
termine if it is safe or unsafe. Therefore, throughout 
this paper, “conducting observations” will refer to the 
observation and evaluation of another person’s safety 
performance. 


get behaviors on the checklist when they are, 
in turn, observed. Although behavior ana- 
lysts have closely examined and discussed the 
behavior of observers (e.g., Kazdin, 1977; 
Repp, Nieminen, Olinger, & Brusca, 1988), 
the focus of such discussion has been on ex- 
perimental observers (i.e., outsiders to the 
population under study). Because most ap- 
plied behavior analysis research describes 
programs implemented and evaluated by re- 
searchers (and not employees at the research 
site), questions focused on employee observ- 
ers who are part of the internal workings of 
the system have not been widely studied. 
Therefore, the purpose of the present study 
was to examine the effects of the observation 
process, specifically to determine if the pro- 
cess of conducting safety observations of tar- 
get behaviors at one point in time has an 
effect on the observer’s subsequent perfor- 
mance of those same behaviors. 

METHOD 

Participants, Setting, and Sessions 

Participants were 8 undergraduate univer- 
sity students (7 women and 1 man), and the 
study took place in simulated offices at a 
university campus. The offices were two ob- 
servation rooms equipped with corner- 
mounted video cameras, which were used to 
record all sessions. For the purpose of this 
study, each observation room was furnished 
with the following equipment to resemble an 
office workstation: a desk, two chairs, a com- 
puter, a telephone, and an empty cardboard 
box. All observation sessions were 15 min in 
length. 

Definition and Measurement of 
Dependent Variables 

All dependent variables, except lifting, 
were defined in terms of the percentage of 
intervals in which they occurred. Percentage 
occurrence was calculated by adding the 
number of intervals in which each safe be- 
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havior was observed, dividing it by the total 
number of observation intervals, and mul- 
tiplying by 100%. Safe lifting frequency was 
counted and reported as a percentage of total 
number of lifts. Each task was comprised of 
several target behaviors; for example, per- 
forming a lift required engaging in two tar- 
get behaviors: keeping the back aligned and 
bending at the knees. The eight target be- 
haviors and their definitions were as follows: 
When lifting or putting down an object, (a) 
the back should be straight in a natural up- 
right position throughout the lift, the back 
should not be twisted or parallel to the floor 
and (b) knees should be slightly bent (120° 
angle is recommended). When typing, (c) 
wrists should be in line with forearms and 
elbows, they should not be bent upwards or 
downwards, and (d) the neck should be up- 
right and aligned with the back, and eyes 
should be level with the screen and docu- 
ment. When sitting, (e) the back should be 
upright and parallel against the back of the 
chair (not leaning at an angle against it), (f) 
shoulders should be aligned with the back 
and not slouched forward, and (g) both feet 
should be flat on the floor (ball of foot and 
heel should touch the floor). When using 
the telephone, (h) the neck should be up- 
right and aligned with the back. Govern- 
ment ergonomic reports were reviewed to 
determine appropriate behaviors and their 
operational definitions (Occupational Safety 
and Health Administration, 1998; Office of 
Health and Safety Information System, 
1998). 

Each session was videotaped and scored at 
a later time by a trained observer using a 
checklist that contained each of the target 
behaviors and the definitions listed above. A 
momentary time-sampling procedure was 
used for data collection. Every 30 s, data 
were collected for behaviors occurring at that 
moment, with the exception of lifting. Tar- 
get behaviors were scored in a specified or- 
der. For example, if a participant was using 


the phone, the coder scored the four relevant 
target behaviors that occurred at the 30-s in- 
terval, in a specific order: back, shoulders, 
and feet (for sitting); and neck (while using 
the phone). Because lifting occurred so in- 
frequently, the chances were very high that 
it would not be detected using the time-sam- 
pling procedure. Therefore, data were col- 
lected for every lift, regardless of when it oc- 
curred. A behavior was scored as being safe 
when it satisfied the definition listed above. 

Interobserver Agreement 

A second observer independently scored 
40% of the sessions. An agreement was de- 
fined as an occurrence in which both ob- 
servers scored the same mark (safe or unsafe) 
for each particular behavior. Interobserver 
agreement was calculated as the number of 
agreements divided by the number of agree- 
ments plus disagreements multiplied by 
100%. Overall agreement between observers 
averaged 96.9% (range, 89% to 100%). 

Procedure 

Baseline. Before the start of the study, par- 
ticipants were randomly assigned to Group 
A or B. At the start of each baseline session, 
all participants were handed a list of instruc- 
tions and were asked to perform the tasks 
described in the instructions. The tasks were 
(a) typing a few paragraphs using a word 
processor on a computer; (b) dialing a phone 
number and leaving a brief message on an 
answering machine; and (c) picking up a 
cardboard box containing five pieces of pa- 
per, placing it on a chair, removing a specific 
piece of paper, and placing the box back 
down on the floor. Each task was repeated 
at least four times, and the order of tasks 
varied from session to session. 

Information phase. During the first session 
of the information phase, participants were 
informed about the nature of the study. All 
participants were told that the purpose of 
the study was to observe individual safety 
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behaviors in an office setting and were then 
given a handout containing definitions for 
four of the eight target behaviors and how 
to perform them safely. Group A received 
information on one set of four behaviors 
(back straight and knees bent when lifting, 
neck and wrist position while typing), and 
Group B received information on the other 
four behaviors (back, shoulder, and feet po- 
sition when sitting; neck alignment when us- 
ing the phone). Participants were given this 
information at the start of each session and 
were asked to study it for 5 min before they 
were handed the list of tasks to perform. The 
remainder of the session followed the same 
procedures as those during baseline. The 
purpose of this intervention phase was two- 
fold: to eliminate demand characteristics 
that are often displayed by participants tak- 
ing part in a laboratory study (Kazdin, 
1992) and to separate the effects of infor- 
mation from those of conducting safety ob- 
servations. Often participants in a laboratory 
study try to please the experimenter by en- 
gaging in the behaviors that they believe the 
experimenter is interested in observing; this 
is referred to as a demand characteristic. The 
information phase was created to minimize 
the effects of demand characteristics during 
the observation phase. 

Observation phase. Before being given a 
list of tasks to perform, participants were 
asked to observe a 5-min videotape of an 
experimental confederate performing tasks 
similar to those the participant performed 
during each session. Participants were then 
asked to collect data on the confederate’s 
safety performance using a checklist contain- 
ing the same four target behaviors they had 
been given during the information phase 
(Group A scored the first four; Group B 
scored the second four) and definitions on 
how to perform each safely. Participants 
scored the videotape using an event-record- 
ing procedure; thus, a behavior was scored 
immediately after observing its occurrence. 


Participants scored a different 5-min video- 
tape before every session during the obser- 
vation phase, and all videotapes included a 
random combination of safe and unsafe be- 
haviors. Immediately after scoring the vid- 
eotape, participants were instructed to per- 
form the tasks as in the baseline condition. 

Experimental Design 

The experimental design included com- 
ponents of a multielement and a multiple 
baseline design. Initially, a multielement de- 
sign was used to compare the two interven- 
tions (information and observation) with 
baseline. As indicated above, participants 
were randomly assigned to Group A or 
Group B. In accordance with a multielement 
design, the first four target behaviors were 
sequentially exposed to the information and 
observation conditions for Group A while 
the remaining four target behaviors re- 
mained in baseline. For Group B, the last 
four target behaviors were sequentially ex- 
posed to the information and observation 
conditions while the first four target behav- 
iors remained in baseline. In accordance 
with a multiple baseline design, once stable 
levels of target behaviors were obtained in 
the observation condition for half of the tar- 
get behaviors (i.e., the first four for Group 
A or the last four for Group B), the obser- 
vation condition was introduced for the re- 
maining target behaviors (i.e., the last four 
for Group A and the first four for Group B). 

Integrity of the Independent Variables 

Scripts were developed and used for all of 
the verbal instructions that were given to the 
participants. This ensured that all partici- 
pants were exposed to the same instructional 
set. During the information phase, partici- 
pants were required to check off each be- 
havior and safety definition on the infor- 
mation sheet and sign the sheet to ensure 
that they had been provided with informa- 
tion during each session throughout this 
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phase. The safety checklists used by the par- 
ticipants during the observation phase were 
collected to provide verification that partic- 
ipants conducted observations during the 
session. The videotapes shown to the partic- 
ipants were arranged in a specific order to 
ensure that all participants were exposed to 
the same sequence. 

RESULTS 

Figures 1 and 2 show the safety perfor- 
mance of the 4 participants in Group A, and 
Figures 3 and 4 show results for the 4 par- 
ticipants in Group B. Each figure has two 
panels per participant, one showing the re- 
sults for the first four target behaviors (e.g., 
Panel 1 in Figure 1 for Participant 1) and 
the other showing the results for the last four 
target behaviors (e.g., Panel 2 in Figure 1 for 
Participant 1). 

Across the 8 participants, there were 32 
opportunities for replication of target-behav- 
ior changes (four different behaviors for each 
of 8 participants) during the information 
phase. Safety performance remained un- 
changed from baseline for 25 of 32 oppor- 
tunities (78%). Temporary increases were 
observed on three occasions (9%) for two 
different behaviors: lifting with knees (Par- 
ticipants 2 and 3) and neck position while 
typing (Participant 4). These increases were 
followed by gradual decreases toward base- 
line performance. Strong effects occurred for 
one behavior (sitting with feet on the floor) 
for Participants 5, 6, 7, and 8 (13%). 

Across the 8 participants, there were 64 
opportunities for replication of target-behav- 
ior changes (eight different behaviors for 
each of 8 participants) during the observa- 
tion phase. No effects were observed for 4 
of 64 opportunities (6%). Of these four op- 
portunities, two failed to change because of 
a procedural shortcoming. Performance did 
not change for Participant 5 (Figure 3) on 
the two behaviors involved with lifting: back 


alignment and knee bend. The reason no 
improvements occurred in this case was that 
the participant never left her chair to lift, 
and therefore she could not lift properly. We 
never instructed her to stand when lifting, 
because that would have amounted to 
changing the experimental protocol in a sin- 
gle case. No other participants acted in this 
manner, nor did they receive special instruc- 
tion. 

Gradual improvements in performance 
were observed for 18 of 64 opportunities 
(28%); these occurred most frequently in 
typing and lifting tasks. A clear example of 
a gradual improvement can be seen in the 
lifting performance of Participant 1 (Figure 
1). Back alignment and correct knee bends 
increased gradually across each session dur- 
ing the observation phase. Both behaviors 
remained at 0% safe during the first session 
of the observation phase and gradually im- 
proved to 83% (back) and 100% (knees) 
safe during the last session of the phase. 

Of the 64 behaviors targeted across 8 par- 
ticipants, 42 (66%) increased substantially 
after the implementation of the observation 
phase. The most dramatic of these changes 
can be seen in the lifting performance of 
Participant 6 (Figure 3). Both back and knee 
behaviors averaged 0% safe during baseline 
and 100% safe during the observation phase. 

DISCUSSION 

The results of the current study suggest 
that improvements in performance occurred 
as a result of conducting observations of 
confederate performance. Using a checklist 
to observe and evaluate the safety perfor- 
mance of others increased the safety behav- 
ior of the observer. It is important to note 
that the increases in safety performance oc- 
curred as a result of observing and evaluating 
a confederate’s safety performance; observing 
alone did not change safety performance. 
The multiple baseline design used in this 
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Figure 1. The percentages of target responses completed correctly during baseline, the information condi- 
tion, and the observation condition for Participant 1 (top two panels) and Participant 2 (bottom two panels). 


study allows us to draw this conclusion. In- 
creases in safety performance occurred only 
on the behaviors targeted on the safety 
checklist. In other words, all eight target be- 


haviors were observed when a participant 
scored a videotape, but increases occurred 
only on the behaviors being observed and 
evaluated using the checklist. It is unclear, 
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Sessions 


Figure 2. The percentages of target responses completed correctly during baseline, the information condi- 
tion, and the observation condition for Participant 3 (top two panels) and Participant 4 (bottom two panels). 
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Figure 3. The percentages of target responses completed correctly during baseline, the information condi- 
tion, and the observation condition for Participant 5 (top two panels) and Participant 6 (bottom two panels). 
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Figure 4. The percentages of target responses completed correctly during baseline, the information condi- 
tion, and the observation condition for Participant 7 (top two panels) and Participant 8 (bottom two panels). 
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from a behavioral perspective, why these in- 
creases in the target behaviors occurred dur- 
ing the observation phase, and why they 
usually did not occur during the information 
phase. Another point of interest is why some 
behaviors increased immediately from 0% to 
100%, whereas others showed more gradual 
increases. 

Based on exit interviews with participants, 
we suspect that self-monitoring was a poten- 
tial causal factor involved in the changes that 
occurred during the observation phase. Par- 
ticipants almost uniformly reported that 
scoring the videotape caused them to begin 
to self-monitor during their own sessions. 

The current results add to the literature 
on safety performance in several ways. First, 
our literature search revealed no other at- 
tempts to test systematically the effects of 
conducting safety observations on observer 
behavior in the field of occupational safety. 
In addition, the effects of observing were ro- 
bust, suggesting that this may be a simple 
and efficient method of increasing the safety 
performance of employees. 

Performance data were collected for each 
participant and provided an important 
strength of the current study. Although fu- 
ture research is required to determine the be- 
havioral functions of conducting observa- 
tions, this study suggests that improvement 
in ergonomic safety does not negatively af- 
fect productivity. As another strength, the 
exit interviews provided important infor- 
mation concerning the possible causes of be- 
havior change. We are currently conducting 
studies to investigate empirically these po- 
tential sources of control. 

The laboratory setting used for the cur- 
rent research created both strengths and 
weaknesses. Participants were college stu- 
dents rather than actual employees. How- 
ever, although the participants created no or- 
ganizationally relevant products, it closely 
simulated jobs performed regularly by office 
personnel. A weakness of the study was that 


we failed to perform complete independent- 
variable integrity checks for information and 
observation. It was verified that participants 
were exposed to the appropriate form of in- 
formation and videotapes, and that partici- 
pants scored the videotapes as planned. 
However, we did not test the knowledge of 
the participants after information exposure, 
nor did we measure the accuracy of obser- 
vations during scoring. Future studies should 
attend to this deficiency accordingly. 

Although the current research suggests 
that observing the behavior of others chang- 
es the behavior of the observer, future re- 
search should build on this experiment by 
manipulating variables related to the effect. 
For example, future studies could examine 
the potential moderating influence of obser- 
vation accuracy, type and specificity of the 
behavioral checklist, and variation in prop- 
erties of the responses observed. 

If an observer effect exists in some or 
most cases of behavior-analytic approaches 
to occupational safety, practitioners may 
want to adjust methods of behavioral safety 
implementations to have all employees con- 
duct observations. This could change con- 
siderably the focus of consultant-based be- 
havioral safety implementations. Given a 
limited number of training days from con- 
sultants, safety managers may decide to de- 
vote more time to employee observation 
training than is currently the case. 
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STUDY QUESTIONS 


1 . Briefly describe the four principal components of an effective behavior-based safety process 
outlined by the authors. 


2. Which aspect of behavior-based safety processes has been the typical focus of previous re- 
search, and which aspect was examined in this investigation? 


3. 


Briefly describe the general experimental context and target behaviors. 


4. How were the target behaviors measured? 


5. Briefly describe each of the experimental conditions. 


6. Summarize the results obtained during the information and observation conditions of the 
study. 
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7. According to the authors, what behavioral process probably accounted for the observed 
changes in performance, and what evidence did they offer to support the influence of this 
process? 


8. The authors suggested that improvement in ergonomic safety does not affect productivity 
negatively. What data would support or refute this claim? 


Questions prepared by Jennifer N. Fritz and Sarah E. Bloom, University of Florida 


